PCX 



WORLD INTCLUECTUAL PROPERTY ORGANIZATION 
International Bareui 




INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patoit Classification ^ : 
H04N S/775 



Al 



(U) International Publication Number: WO 00/07368 

(43) International Publication Date: 10 Febfuaiy 2000^1 0.02.00) 



(21) Intematiooal Application Numben PCrA;S99/04894 

(22) Inttfnational Filing Date: 4 March 1999 (04.03.99) 



(30) PHority Data: 
09/126.071 



30 July 1998 (30X)7.98) 



US 



(71) AppUcant: TIVO, INC. fUSAJS]; 894 Ross Drive, Sunnyvale. 

CA 94089 (US). 

(72) Inventors: BARTON, James. M.; 101 Sund Avenue. Los 

Gatos. CA 95030 (US). NfcINNIS. Roderick. James; 1299 
Canton Drive. Milpitas. CA 9S03S (US). MOSKOWnZ 
Alan. S.; 300 Third Street. San Pnuicisco. CA 94107 (US). 
GOODMAN, Andrew. Martin; 2171 Avy Avenue. Menlo 
Parte CA 94025 (US). CHOW. Ching. Tong; 920 Seville 
Place. Fiemont. CA 94539 (US). KAO, Jean. Swey; 21876 
Meadow View Lane. Cupertino. CA 95014 (US). 

(74) Agents: GLENN, Michael, A. et al.; Law Offices of Michael 
A. Glenn. 125 Lake Road. Portota Valley, CA 94028 (US). 



(81) Dedgnated States: AL, AM, AT. AU, AZ. BA. BB. BG, BR, 
BY, CA, CH, CN, CU. CZ, DE. DK, EE, ES. FI, GB, GD. 
GE, GH, GM, HR, HU, tD, IL. IN, IS, JP, KE, KG. KP, 
KR, KZ, LC, LK, LR, LS, LT. LU, LV. MD, MG, MK, 
MN, MW. MX, NO, NZ, PL. PT, RO, RU. SD, SE, SG, 
SI SK, SU TJ, TM, TR, TT, UA, UG. U2, VN, YU, ZW, 
ARIPO patent (GH. GM. KE, LS, MW. SD, SL, SZ, UG. 
ZW), Eurasian patent (AM, AZ, BY. KG, KZ, MD. RU. TJ. 
TM), European patent (AT, BE. CH, CY. DE. DK, ES. FI, 
FR, GB, GR. IE, IT. LU, MC. NL. PT, SE), GAP! patent 
(BF, BJ. CF, CG. a. CM. GA, GN, GW, ML. MR, NE, 
SN. TD, TO). 



Publldied 

With international search report. 



(54) Title: MULTIMEDL\ TIME WARPING SYSTEM 



Input 
streams 




(57) Abstract 

The invention allows the user to store selected television broadcast programs while the user is simultaneously watching or reviewing 
another program. A prefeired embodiment of the invention accepts television (TV) input sueams in a multitude of analog and digital forms. 
The TV streams ar& converted to (MPEG) formatted stream for internal transfer axtd manipulation and are paned and separated it into video 
and audio components. The components are stored in temporary buffers. Events are recorded that indicate the type of component that has 
been found, where it is located, and when it occurred. The program logic is notified that an event has occurred and the data is extracted 
from the buffers. The parser and event buffer decouple the CPU from having to parse the MPEG stream and from the real time nature of 
the data streams which allows for slower CPU and bus speeds and translate to lower system costs. The video and audio components are 
stored on a storage device and when the program is requested for display, the video and audio components are exo^scted from the storage 
device and reassembled into an MPEG stream which is sent to a decoder. The decoder converts die MPEG stream into TV output signals 
and delivers the TV output signals to a TV receiver. User control commands aie accepted and sent through the system. These commands 
affect the flow of said MPEG stream and allow die user to view stored programs with at least the following ftmctions: reverae. fast forward, 
play, pause, index, fast/slow reverse play, and fast/slow play. 
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wo0(m)7368 pct/usw/04894 
Multimedia Time Warping System 

BACKGROUND QF THE INVENTION 



TECHNICAL FIELD 

The invention relates to the time shifting of television broadcast signals. More 
particularly, the invention relates to the real time capture, storage, and display of 
television broadcast signals. 

DESCRIPTION OF THE PRIOR ART 

The Video Cassette Recorder (VCR) has changed the lives of television (TV) 
viewers throughout the world. The VCR has offered viewers the flexibility to time- 
shifl TV programs to match their lifestyles. 

The viewer stores TV programs onto magnetic tape using the VCR. The VCR 
gives the viewer the ability to play, rewind, fast fonvard and pause the stored 
program material. These functions enable the viewer to pause the program 
playback whenever he desires, fast fonA^ard through unwanted program material or 
commercials, and to replay favorite scenes. However, a VCR cannot both capture 
and play back information at the same time. 

One approach to solving this problem is to use several VCRs. For example, if two 
video tape recorders are available, it might be possible to Ping-Pong between the 
two. In this case, the first recorder is started at tiie beginning of the program of 
interest. If the viewer wishes to rewind the broadcast, the second recorder begins 
recording, while the first recorder is halted, rewound to the appropriate place, and 
playback initiated. However, at least a third video tape recorder is required if the 
viewer wishes to fast fon/vard to some point in time after the initial rewind was 
requested. In ttiis case, the third recorder starts recording the broadcast stream while 
the second is halted and rewound to the appropriate position. Continuing this 
exercise, one can quickly see that the equipment becomes unwieldy, unreliable, 
expensive, and hard to operate, while never supporting all desired functions. In 
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addition, tapes are of finite length, and may potentially end at inconvenient times, 
drastically lowering the value of the solution. 

The use of digital computer systems to solve this problem has been suggested. 
U.S. Pat. No. 5.371.551 issued to Logan etal .. on 6 December 1994, teaches a 
method for concurrent video recording and playback. It presents a microprocessor 
controlled broadcast and playback device. Said devtee compresses and stores 
video data onto a hard disk. However, this approach is difficult to implement 
because the processor requirements for keeping up with the high video rates makes 
the device expensive and problematic. The microprocessor must be extremely 
fast to keep up with the incoming and outgoing video data. 

It would be advantageous to provide a multimedia time warping system that gives 
the user the ability to simultaneously record and play back TV broadcast programs. 
It would further be advantageous to provide a multimedia time warping system that 
utilizes an approach that decouples the microprocessor from the high video data 
rates, thereby reducing the mfcroprocessor and system requirements which are at a 
premium. 

ff^ifiiMARY OF THE INVENTION 

The invention provides a multimedia time warping system. The invention utilizes an 
easily manipulated, low cost multimedia storage and display system that allows the 
user to view a television broadcast program with the option of instantly reviewing 
previous scenes within the program. In addition, the invention allows the user to 
store selected television broadcast programs while the user is simultaneously 
watching or reviewing another program. 

A preferred embodiment of the invention accepts television (TV) input streams in a 
multitude of forms, for example, analog forms such as National Television Standards 
Committee (NTSC) or PAL broadcast, and digital forms such as Digital Satellite 
System (DSS). Digital Broadcast Services (DBS), or Advanced Television 
Standards Committee (ATSC). Analog TV streams are converted to an Moving 
Pictures Experts Group (MPEG) fonnatted stream for internal transfer and 
manipulation, while pre-formatted MPEG streams are extracted from the digital TV 
signal and presented in a similar format to encoded analog streams. 
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u. KiiPPf^ Stream and separates rt into its video and 

::.r::r;rrr~ss^-i-— ■ 

^u,«a and the daa IS «»aaed *om W bufl«. 

: -"-t"": aXlaTemS^o an MPEG stream. The MPEG 
^Tcf rode, r-etde, conve«= ^ MPEG strean. into TV 

r:^3:rdr^;eTvoo.p.^a.to,Tv,ece^. 

„Hc «r6 accepted and sent through the system. These 
user control commands are accepte ^.^^ ^^^^^ 

^..as./slaw«ve,eepta,,andtes.(aowptey. 

. nf the invention will become apparent from the 

» hinh level view of a prefen-ed embodiment 
Fig 1 is a block schematic diagram of a htgh level view 

of the invention according to the inventK>n; 
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Fig 4 is a block schema* diagram o. a parser and four direct memory access 
(DMA) input engirds oornair^dh the Media Sv«Kh according to lhe«en»^^ ^ 

Fig. 5 IS a schematic diagram of the components of a packetized elementary stream 
(PES) buffer acoorxllng to the invention; 

Fig 6 is a schematic diagram of the construction of a PES buffer from the parsed 
components in the Media Switch output circular buffers; 

Rg. 7 is a block schematic diagram of the Media Switch and the various components 
that it communicates with according to the invention; 

Fig. 8 is a block schematic diagram of a high level view of the program logic 
according to the inventton; 

F.g. 9 is a block schematic diagram of a dass hierarchy of the program togic according 
to the invention; 

Fig 10 is a block schematic diagram of a preferred embodiment of the clip cache 
component of the invention according to the invention: 

ng 11 isablock schematic diagram ofapreferred embodiment oftheinventtonthat 
emulates a broadcast studio video mixer according to the im«nt«n; 

Fig. 12 is a block schematic diagram of a closed caption parser according to the 
invention; and 

Fig 13 is a bkx=k schematic diagram ot a Ngh level view of a P™*-^.'"*"*™^ 
?thl ir^ntlon ut«lzing a VCR as an »neg«l componem Of the ,n«nton a«o,* 

to the inventton. 

The invention is embodied in a multimedia time warping system. A system 
rocolg lo the invention provides a munimedia -age and ^^^^^^ 
allows the user to view a television broadcast program with the option of instantly 
^wi^g prelus scenes within the program. The invention addHlonally pro^des 
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the user wKh the abWty to store selected television broadcast programs while 
^muZell watching or renewing another program and to view stored prog«n« 
r^tle Allowing lonctlons: reverse, fas. forward, play, pause. ,nde* 
fast/slow reverse play, and last/slow play. 

Heferrlna to Fig 1 , a preferred emt»diment of th. invention has an Input SecHon 
rZia slh ,02, and an Output Section ,03. The Input Section ,0,J-«» 
e°Lion (TV) input streams in a multKud. of fom,s, for example Nafonal 
'■^n Sta^a^a Comm«,e (NTSC, or PAt broadcast, -d d^~ ^^^^^^ 
as Digilal SateKlte System (DSS). Digital Broadcast Sendees (DBS), or Advanced 
Television Standards CommMee (ATSC). DBS, DSS and ATSC a'e based on 
Tarda ds called Moving Piclures Experts Group 2 ("f^f j^"^^^ 
?mnsLrt MPEQ2 Transport is a standaid for tom^ the digital data sl.«mfiom 
rrsJe.,ansmmersomataTVreceiv.rc«.<....s«nbleth.i.|PUt^^^ 

Td prArams in the multiplexed signal. The Input Section ,01 ^-^^^^^ 
ZU An MPEQ2 transport multiplex supports multiple programs ,n the same 
r^^Jl with muS;. v^eo and aud^ feeds and pr.a.e dat. The Inp^ 
Section ,0, tunes the channel to a particular program, extracts a specific MPEG 

out of it and feeds itto the rest of ,he system. Analog TV signal, are 
'eSil .m«arMPEGtom«tusingsepar«sv«eoa,^audloenc«ls«.s^ 

remainder of «,e system Is unaware of how the signal was obtam*,. 
^^fortion mayb. modulated NO the vertical BtoWng Inteival (VBI) of me anal^ 
wZ^h,anUerofs.».da,dw.y.;forexamp.e,^e North A^ncanBn^^^ 

T^txt Standwd (NABTS) may be used to modulate infomiation o o lines ,o 
iZoh 20 of an f^TSC signal, while the FCC mandates the use of Ime 2, for 
OlZ Cain (CO and Extended Data Services (EDS). Such signals are 
dl^d by 1 ilpu. secaon and passed to «,e om.. section, a. if «tey were 
delivered via an MPEQ2 privale data channel. 

The Media switch ,02 mediates between a microprceessor CPU ,06 hard disk or 
IZrdevice ,0S. and memory t04. Input streams are converted to an MPEG 
and sent fo the Media Switch t02. The Media S^ ,02 buffers *e 
MreG stream into memory, it then pertomis two operations if the user is watching 
reafLT-?V: the stream is sent to the C^tpu. Section ,03 and « .. wntten 
simultaneously to the hard disk or storage device ,05. 
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The Output Section 103 takes MPEG streams as input and produces an analog TV 
signal according to the NTSC. PAL. or other required TV standards. The Output 
section 103 contains an MPEG decoder. On-Screen Display (OSD) generator 
analog TV encoder and audio logic. The OSD generator allows the program log.c to 
supply images which will be overlayed on top of the resulting analog TV signal. 
Additionally, the Output Section can modulate infomiation supplied by the program 
logic onto the VBI of the output signal in a number of standard fomnats. including 
NABTS, CO and EDS. 

Wrth respect to Fig. 2. the invention easily expands to accommodate multiple Input 
Sections (tuners) 201 . 202. 203. 204. each can be tuned to different types of inpirt. 
M^ Output^/'odules (decoders) 206. 207. 208. 209 are added as we . 
Special effects such as picture in a picture can be implemented with multiple 
decoders The Media Switch 205 records one program while the user Is watching 
another. This means that a stream can be extracted off the disk while another stream 
is being stored onto the disk. 

Referring to Fig. 3. the Incoming MPEG stream 301 has interleaved video 302 3C^_ 
306 and audio 303. 304. 307 segments. These elements must be separated and 
recombined to create separate video 308 and audio 309 streams or buffers. This is 
Tesstry because separate decoders are used to convert MPEG elements^c^ 
into audio or video analog components. Such separate delivery requires that time 
sequence infomiation be generated so that the decoders may be properly 
synchronized for accurate playback of the signal 

The Media Switch enables the program logic to associate proper time sequen^ 
infomiation with each segment, possibly embedding it directly Into the st^am. The 
time sequence infomiation for each segment is called a time stamp. These tmie 
stamps are monotonicalh. increasing and start at zero each time the system b<x.ts 
up This allows the invention to find any particular spot in any particular video 
segment. For example, if the system needs to read five seconds into anjncom.ng 
contiguous video stream that is being cached, the system simply has to start reading 
forward into the stream and took for the appropriate time stamp. 

A binaor search can be perfomied on a stored file to index into a stream. Each 
stream is stored as a sequence of fixed-size segments enab ing fast b nary 
searches because of the unifom. time stamping. If the user wants to start in the 
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middle of the program, the system perfomis a tiimry search of the stored segments 
until it finds the appropriate spot, obtaining the desired results with a minimal amount 
of information. If the signal were instead stored as an MPEG stream, it would W 
necessary to linearty parse the stream from the beginning to find the desired location. 

With respect to Fig. 4. the Media Switch contains four input Direct Memory Access 
(DMA) engines 402. 403. 404. 405 each DMA engine has an associated buffer 
410 411 412 413. Conceptually, each DMA engine has a pointer 406. a limit for 
that'pointer 407. a next pointer 408. and a limit for the next pointer 409. Each DMA 
engine is dedicated to a partcular type of infomiation. for example, video 402. aud^ 
4(» and parsed events 405. The buffers 41 0.411. 41 2. 41 3 are circular and collect 
the 'specific infomiation. The DMA engine increments the pointer 406 into the 
associated buffer until it reaches the limit 407 and then loads the next pointer 408 
and limit 409. Setting the pointer 406 and next pointer 408 to the same value, 
along with the corresponding limit value creates a circular buffer. The next pointer 
408 can be set to a different address to provide vector DMA. 

The input stream flows through a parser 401 . The parser 401 parses the stream 
looking for MPEG distinguished events indicating the start of video, audio or pnvate 
data segments. For example, when the parser 401 finds a video event, it directs 
the stream to the video DMA engine 402. The parser 401 buffers up data and 
DMAS it into the video buffer 410 through the video DMA engine 402. At the same 
time the parser 401 directs an event to the event DMA engine 405 which 
generates an event into the event buffer 413. When the parser 401 sees an audio 
event, it redirects the byte stream to the audio DMA engine 403 and generates an 
event into the event buffer 413. Similarly, when the parser 401 sees a private data 
event it directs the byte stream to the private data DMA engine 404 and directs an 
event to the event buffer 413. The Media Switch notifies the program logic via an 
intemjpt mechanism when events are placed in the event buffer. 

Referring to Figs. 4 and 5. the event buffer 413 is filled by the pa^er 401 wrth 
events Each event 501 in the event buffer has an offset 502. event type 503. and 
time stamp field 504. The parser 401 provides the type and offset of each even 
as it is placed into the buffer. For example, when an audio event occurs, the event 
type field is set to an audio event and the offset indicates the location in the audio 
buffer411. The program logic knows where the audio buffer 411 startsandadds 
tt,e offset to find the event in the stream. The address offset 502 tells the program 
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logic Where the next event occuned. but not where it ended. The previous event is 
cached so the end of the current event can be found as well as the length of the 
segment. 

With respect to Figs. 5 and 6. the program logic reads accumulated events in the 
event buffer 602 when it is interrupted by the Media Switch 601 . From these 
events the program logic generates a sequence of logical segments 603 which 
correspond to the parsed MPEG segments 615. The program logic converts the 
offset 502 Into the actual address 610 of each segment, and records the event 
length 609 using the last cached event. If the stream was produced by encoding an 
analog signal. It will not contain Program Time Stamp (PtS) values, which are used 
by the decoders to properly present the resulting output. Thus, the program logic 
uses the generated time stamp 504 to calculate a simulated PTS for each segment 
and places that into the logical segment time stamp 607. In the case of a digital TV 
stream, PTS values are already encoded in the stream. The program logic extracts 
this information and places ft in the logical segment time stamp 607. 

The program logic continues collecting logical segments 603 until it reaches the fixed 
buffer size. When this occurs, the program logic generates a new buffer, called a 
Packetized Elementary Stream (PES) 605 buffer containing these logical segments 
603 in order, plus ancillary control infomnation. Each logical segment points 604 
directly to the circular buffer. e.s.. the video buffer 613. filled by the Media Switch 
601 . This new buffer is then passed to other logic components, which may further 
process the stream in the buffer in some way, such as presenting it for decoding or 
writing it to the storage media. Thus, the MPEG data is not copied from one location 
in memory to another by the processor. This results in a more cost effective design 
since lower memory bandwidth and processor bandwidth is required. 

A unique feature of the MPEG stream transformation into PES buffers is that the data 
associated with logical segments need not be present in the buffer itself, as 
presented above. When a PES buffer is written to storage, these logical segments 
are written to the storage medium in the logical order in which they appear. This has 
the effect of gathering components of the stream, whether they be in the video, 
audio or private data circular buffers, into a single linear buffer of stream data on the 
storage medium. The buffer is read back from the storage medium with a single 
transfer from the storage media, and the logical segment infonnation is updated to 
conespond with the actual locations in the buffer 606. Higher level program logic is 
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unaware of this transformation, since it handles only the logical segments, thus stream 
data is easily managed without requiring that the data ever be copied between 
locations in DRAM by the CPU. 

A unique aspect of the Media Switch is the ability to handle high data rates 
effectively and inexpensively. It perfomis the functions of taking video and audio 
data in, sending video and audio data out, sending video and audio data to disi<, and 
extracting video and audio data from the disk on a tow cost platform. Generally, the 
Media Switoh runs asynchronously and autonomously with the mteroprocessor CPU. 
using its DMA capabilities to move large quantities of information with minimal 
intervention by the CPU. 

Referring to Fig. 7, the input side of the Media Switch 701 is connected to an MPEG 
encoder 703. There are also circuits specific to MPEG audio 704 and vertical 
blanking interval (VBI) data 702 feeding into the Media Switch 701 . If a digftal TV 
signal is being processed instead, the MPEG encoder 703 is replaced with an 
MPEG2 Transport Demultiplexer, and the MPEG audio encoder 704 and VBI 
decoder 702 are deleted. The demultiplexer multiplexes the extracted audio, 
vWeo and private data channel streams through the video input Media Switch port. 

The parser 705 parses the input data streatn from the MPEG encoder 703, audio 
encoder 704 and VBI decoder 702. or from the transport demultiplexer in the case 
of a digital TV stream. The parser 705 detects the beginning of all of the important 
events in a video or audio stream, the start of all of the frames, the start of sequence 
headers - all of the pieces of infomfiation that the program logic needs to know about 
in order to both properly play back and perfonn special effects on the stream, e.g. 
fast fonward, reverse, play, pause, fast/slow play, indexing, and fast/slow reverse 
play. 

The parser 705 places tags 707 into the FIFO 706 when it identifies video or audio 
segments, or is given private data. The DMA 709 controls when these tags are 
taken out. The tags 707 and the DMA addresses of the segments are placed into 
the event queue 708. The frame type infonnation, whether it is a start of a video 
l-frame. video B-frame, video P-frame, video PES, audto PES, a sequence header, 
an audio frame, or private data packet, is placed into the event queue 708 along with 
the offset in the related circular buffer where the piece of infomnatfon was placed. The 
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program logic operating in the CPU 713 examines events in the circular buffer after it 
is transferred to the DRAM 714. 

The Media Switch 701 has a data bus 711 that connects to the CPU 713 and 
DRAM 714. An address bus 712 is also shared between the Media Switch 701 , 
CPU 713, and DRAM 714. A hard disk or storage device 710 is connected to one 
of the ports of the Media Switch 701 . The Media Switch 701 outputs streams to an 
MPEG video decoder 715 and a separate audio decoder 717. The audio decoder 
717 signals contain audio cues generated by the system in response to the user's 
commands on a remote control or other intemal events. The decoded audio output 
from the MPEG decoder is digitally mixed 718 with the separate audio signal. The 
resulting signals contain video, audio, and on-screen displays and are sent to the TV 
716. 

The Media Switch 701 takes in 8-bit data and sends it to the disk, while at the same 
time extracts another stream of data off of the disk and sends it to the MPEG 
decoder 715. All of the DMA engines described above can be worthing at the 
same time. The Media Switch 701 can be implemented in hardware using a Field 
Programmable Gate Array (FPGA). ASIC, or discrete logic. 

Rather than having to parse through an immense data stream looking for the start of 
where each frame would be, the program logic only has to look at the circular event 
buffer in DRAM 714 and it can tell where the start of each frame is and the frame 
type. This approach saves a large amount of CPU power, keeping the real time 
requirements of the CPU 713 small, The CPU 713 does not have to be very fast 
at any point in time. The Media Switch 701 gives the CPU 713 as much time as 
possible to complete tasks. The parsing mechanism 705 and event queue 708 
decouple the CPU 713 from parsing the audio, vWeo, and buffers and the real time 
nature of the streams, which allows for lower costs. It also allows the use of a bus 
structure in a CPU environment that operates at a much lower clock rate with much 
cheaper memory than would be required othenArise. 

The CPU 713 has the ability to queue up one DMA transfer and can set up the next 
DMA transfer at its leisure. This gives the CPU 713 large time inten/als within whfch 
it can service the DMA controller 709. The CPU 713 may respond to a DMA 
intenrupt within a larger time window because of the large latency allowed. MPEG 
streams, whether extracted from an MPEG2 Transport or encoded from an analog 
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TV signal, are typically encoded using a technique called Variable Bit Rate encoding 
(VBR). This technique varies the amount of data required to represent a sequence 
of images by the amount of movement between those images. This technique carf 
greatly reduce the required bandwidth for a signal, however sequences with rapid 
movement (such as a basketball game) may be encoded with much greater 
bandwidth requirements. For example, the Hughes DirecTV satellite system 
encodes signals with anywhere from 1 to 10Mb/s of required bandwidth, varying 
from frame to frame. It would be difficult for any computer system to keep up with 
such rapidly varying data rates without this structure. 

With respect to Fig. 8, the program logic within the CPU has three conceptual 
components: sources 801, transfomns 802, and sinks 803. The sources 801 
produce buffers of data. Transforms 802 process buffers of data and sinks 803 
consume buffers of data. A transform is responsible for allocating and queuing the 
buffers of data on whteh it will operate. Buffers are allocated as if ^'emptf to sources 
of data, which give them back "full". The buffers are then queued and given to sinks 
as iuir, and the sink will retum the buffer "empty". 

A source 801 accepts data from encoders, e.g., a digital satellite receiver. It acquires 
buffers for this data from the downstream transform, packages the data into a buffer, 
then pushes the buffer down the pipeline as described above. The source object 
801 does not know anything about the rest of the system. The sink 803 consumes 
buffers, taking a buffer from the upstream transform, sending the data to the decoder, 
and then releasing the buffer for reuse. 

There are two types of transforms 802 used: spatial and temporal. Spatial 
transforms are transforms that perform, for example, an image convolution or 
compression/decompression on the buffered data that is passing through. 
Temporal transforms are used when there is no time relation that is expressible 
between buffers going in and buffers coming out of a system. Such a transform 
writes the buffer to a file 804 on the storage medium. The buffer is pulled out at a 
later time, sent down the pipeline, and properly sequenced within the stream. 

Refening to Fig. 9, a C++ class hierarchy derivation of the program logic is shown. 
The TIVo Media Kernel (Tmk) 904, 908, 913 mediates with the operating system 
kemel. The kemel provides operations such as: memory allocation, synchronization, 
and threading. The TmkCore 904, 908, 913 structures memory taken from the 
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media kernel as an object. It provides operators, new and delete, for constructing 
and deconstructing the object. Each object (source 901. transform 902, and sink 
903) is multi-threaded by definitkxi and can mn in parallel. ' 

The TmkPipeline class 905, 909, 914 is responsible for flow control through the 
system. The pipelines point to the next pipeline in the flow from source 901 to sink 
903. To pause the pipeline, for example, an event called "pause" is sent to the first 
object in the pipeline. The event is relayed on to the next object and so on down 
the pipeline. This all happens asynchronously to the data going through the 
pipeline. Thus, similar to applications such as telephony, control of the flow of 
MPEG streams is asynchronous and separate from the streams themselves. This 
allows for a simple logic design that is at the same time powerful enough to support 
the features described previously, including pause, rewind, fast fon^/ard and others, 
in addition, this stmcture allows fast and efficient switching between stream sources, 
since buffered data can be simply discarded and decoders reset using a single 
event, after which data from the new stream will pass down the pipeline. Such a 
capability is needed, for example, when switching the channel being captured by 
the input section, or when switching between a live signal from the input section and 
a stored stream. 

The source object 901 is a TmkSource 906 and the transfomn object 902 is a 
TmkXfmi 91 0. These are intermediate classes that define standard behavtors for the 
classes in the pipeline. Conceptually, they handshake buffers down the pipeline. 
The source object 901 takes data out of a physical data source, such as the Media 
Switch, and places it into a PES buffer. To obtain the buffer, the source object 901 
asks the down stream object in his pipeline for a buffer (allocEmptyBuf). The source 
object 901 is blocked until there is sufftoient memory. This means that the pipeline is 
self-regulating; it has automatic flow control. When the source object 901 has filled 
up the buffer, it hands it back to the transform 902 through the pushFulBuf function. 

The sink 903 is flow controlled as well. It calls nextPullBuf which tells the transform 
902 that it is ready for the next filled buffer This operation can block the sink 903 
until a buffer is ready. When the sink 903 is finished with a buffer (i.e., it has 
consumed the data in the buffer) it calls releaseEmptyBuf. ReleaseEmptyBuf 
gives the buffer back to the transfomn 902. The transfomi 902 can then hand that 
buffer, for example, back to the source object 901 to fill up again. In addition to the 
automatic flow-control benefit of this method, it also provides for limiting the amount 
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of memory dedicated to buffers by allowing enforcement of a fixed allocation of 
buffers by a transform. This is an important feature in achieving a cost-effective 
limited DRAM environment. 

The MediaSwrtch class 909 calls the allocEmptyBuf method of the TmkCllpCache 
912 object and receives a PES buffer from it . It then goes out to the circular buffers 
in the Media Switch hardware and generates PES buffers. The MediaSwitch class 
909 fills the buffer up and pushes it back to the TmkClipCache 912 object. 

The TmkClipCache 912 maintains a cache file 918 on a storage medium. It also 
maintains two pointers into this cache: a push pointer 91 9 that shows where the next 
buffer coming from the source 901 is inserted; and a cunBnt pointer 920 which points 
to the cun^ent buffer used. 

The buffer that is pointed to by the current pointer is handed to the Vela decoder 
class 916. The Vela decoder class 916 talks to the decoder 921 in the hardware. 
The decoder 921 produces a decoded TV signal that is subsequently encoded into 
an analog TV signal in NTSC, PAL or other analog fomiat. When the Vela decoder 
class 91 6 is finished with the buffer it calls releaseEmptyBuf. 

The structure of the classes makes the system easy to test and debug. Each level 
can be tested separately to make sure it performs iri the appropriate manner, arid 
the classes may be gradually aggregated to achieve the desired functionality while 
retaining the ability to effectively test each object. 

The control object 917 accepts commands from the user and sends events into the 
pipeline to control what the pipeline is doing. For example, if the user has a remote 
control and is watching TV, the user presses pause and the control object 917 sends 
an event to the sink 903, that tells it pause. The sink 903 stops asking for new 
buffers. The current pointer 920 stays where it is at. The sink 903 starts taking 
buffers out again when it receives another event that tells it to play. The system is in 
perfect synchronizatbn; it starts from the frame that it stopped at. 

The remote control may also have a fast fon^^ard key. When the fast fonward key is 
pressed, the control object 917 sends an event to the transform 902, that tells it to 
move forward two seconds. The transform 902 finds that the two second time span 
requires it to move fon^/ard three buffers. It then issues a reset event to the 
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downstream pipeline, so tliat any queued data or state that may be present in the 
hardware decoders is flushed. This is a critical step, since the stmcture of MPEG 
streams requires maintenance of state across multiple frames of data, and that state * 
will be rendered invalid by repositioning the pointer. It then moves the current 
pointer 920 fon^^ard three buffers. The next time the sink 903 calls nextFullBuf it 
gets the new current buffer. The same method works for fast reverse in that the 
transfomi 902 moves the current pointer 920 backwards. 

A system clock reference resides in the decoder. The system clock reference is 
sped up for fast play or slowed down for slow play. The sink simply asks for full 
buffers faster or slower, depending on the clock speed. 

With respect to Fig. 10. two other objects derived from the TmkXfmn class are 
placed in the pipeline for disk access. One is called TmkClipReader 1003 and the 
other is called TmkClipWriter 1001 . Buffers come into the TmkClipWriter 1001 and 
are pushed to a file on a storage medium 1004. TmkClipReader 1003 asks for 
buffers which are taken off of a file on a storage medium 1005. A TmkClipReader 
1003 provides only the allocEmptyBuf and pushFullBuf methods, while a 
TmkClipWriter 1001 provides only the nextFullBuf and releaseEmptyBuf methods. 
A TmkClipReader 1003 therefore performs the same function as the input, or "push" 
side of a TmkClipCache 1002, while a TmkClipWriter 1001 therefore performs the 
same function as the output, or "puir side of a TmkClipCache 1002. 

Refening to Fig. 11 , a preferred embodiment that accomplishes multiple functions is 
shown. A source 1101 has a TV signal input. The source sends data to a 
PushSwitch 1 102 which is a transform derived from TmkXfrm. The PushSwitch 
1 102 has multiple outputs that can be switched by the control object 1114. This 
means that one part of the pipeline can be stopped and another can be started at 
the users whim. The user can switch to different storage devices. The PushSwitch 
1102 could output to a TmkClipWriter 1106, which goes onto a storage device 
1 1 07 or write to the cache transform 1 1 03. 

An important feature of this apparatus is the ease with which it can selectively capture 
portions of an incoming signal under the control of program logic. Based on 
infomiation such as the curent time, or pertiaps a specific time span, or pertiaps via 
a remote control button press by the viewer, a TmkClipWriter 1106 may be 
switched on to record a portion of the signal, and switched off at some later time. 
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This switching is typically caused by sending a "switch" event to the PushSwitch 
1102 object. 

An additional method for triggering selective capture is through information 
modulated into the VBI or placed into an MPEG private data channel. Data 
decoded from the VBI or private data channel is passed to the program logic. The 
program logic examines this data to determine If the data indicates that capture of the 
TV signal into which it was modulated should begin. Similarly, this infomiation may 
also indicate when recording should end, or another data item may be modulated 
into the signal indicating when the capture should end. The starting and ending 
indicators may be explicitly modulated into the signal or other information that is 
placed into the signal in a standard fashion may be used to encode this infomfiation. 

With respect to Fig. 12. an example is shown which demonstrates how the program 
logic scans the words contained within the closed caption (CC) fields to determine 
starting and ending times, using particular words or phrases to trigger the capture. A 
stream of NTSC or PAL fields 1201 is presented. CC bytes are extracted from 
each odd field 1202, and entered in a circular buffer 1203 for processing by the 
Word Parser 1204. The Word Parser 1204 collects characters until it encounters a 
word boundary, usually a space, period or other delineating character. Recall from 
above, that the MPEG audio and video segments are collected into a series of 
fixed-size PES buffers. A special segment is added to each PES buffer to hold the 
words extracted from the CC field 1205. Thus, the CC information is preserved in 
time synchronization with the audio and video, and can be correctly presented to the 
viewer when the stream is displayed. This also allows the stored stream to be 
processed for CC infonmation at the leisure of the program logic, which spreads out 
load, reducing cost and improving efficiency. In such a case, the words stored in the 
special segment are simply passed to the state table logic 1206. 

During stream capture, each word is looked up in a table 1206 which indicates the 
action to take on recognizing that word. This action may simply change the state of 
the recognizer state machine 1207, or may cause the state machine 1207 to issue an 
action request, such as "start capture", "stop capture", "phrase seen", or other similar 
requests. Indeed, a recognized word or phrase may cause the pipeline to be 
switched; for example, to overtay a different audio track if undesirable language is 
used in the program. 



15 



wo 00/07368 



PCT/US99/04894 



Note that the parsing state table 1206 and recognizer state machine 1207 may be 
modified or changed at any time. For example, a different table and state machine 
may be provided for each input channel. Alternatively, these elements may bef* 
switched depending on the time of day, or because of other events. 

Refening to Fig. 1 1 , a PuilSwitch is added 11 04 which outputs to the sink 1 105. The 
sink 1 105 calls nextFullBuf and releaseEmptyBuf to get or retum buffers from the 
PuilSwitch 1 104. The PuilSwitch 1 104 can have any number of inputs. One input 
could be an ActionClip 1 1 13. The remote control can switch between input sources. 
The control object 1114 sends an event to the PuilSwitch 1 1 04, telling rt to switch. It 
will switch from the cun-ent input source to whatever input source the control object 
selects. 

An ActionClip class provides for sequencing a number of different stored signals in a 
predictable and controllable manner, possibly with the added control of viewer 
selection via a remote control. Thus, it appears as a derivative of a TmkXfmi object 
that accepts a ''switch" event for switching to the next stored signal. 

This allows the program logic or user to create custom sequences of video output. 
Any number of video segments can be lined up and combined as if the program 
logic or user were using a broadcast studio video mixer. TmkClipReaders 1108, 
1109, 1110 are allocated and each is hooked into the PuilSwitch 1104. The 
PuilSwitch 1104 switches between the TmkClipReaders 1108. 1109, 1110 to 
combine video and audio clips. Flow control is automatic because of the way the 
pipeline is constructed. The Push and Pull Switches are the same as video switches 
in a broadcast studio. 

The derived class and resulting objects described here may be combined In an 
arbitrary way to create a number of different useful configurations for storing, 
retrieving, switching and viewing of TV streams. For example, if multiple input and 
output sections are available, one input is viewed while another is stored, and a 
picture-in-picture window generated by the second output is used to preview 
previously stored streams. Such configurations represent a unique and novel 
application of software transfonnations to achieve the functionality expected of 
expensive, sophisticated hardware solutions within a single cost-effective device. 
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With respect to Fig. 13. a high-level system view is shown which implements a 
VCR backup. The Output Module 1303 sends TV signals to the VCR 1307. This 
allows the user to record TV programs directly on to video tape. The invention* 
allows the user to queue up programs from disk to be recorded on to video tape 
and to schedule the time that the programs are sent to the VCR 1307. Title pages 
(EPG data) can be sent to the VCR 1307 before a program is sent. Longer 
programs can be scaled to fit onto smaller video tapes by speeding up the play 
speed or dropping frames. 

The VCR 1307 output can also be routed back into the Input Module 1301 . In this 
configuration the VCR acts as a backup system for the Media Switch 1302, Any 
overflow storage or lower priority programming is sent to the VCR 1307 for later 
retrieval. 

The Input Module 1301 can decode and pass to the remainder of the system 
infonnation encoded on the Vertical Blanking Interval (VBI). The Output Module 
1303 can encode into the output VBI data provided by the remainder of the 
system. The program logic may arrange to encode identifying infonnation of various 
kinds into the output signal, which will be recorded onto tape using the VCR 1307. 
Playing this tape back into the input allows the program logic to read back this 
identifying infomnation, such that the TV signal recorded on the tape is properiy 
handled. For example, a particular program may be recorded to tape along with 
information about when it was recorded, the source network, etc. When this program 
is played back into the Input Module, this information can be used to control storage 
of the signal, presentation to the viewer, etc. 

One skilled in the art will readily appreciate that such a mechanism may be used to 
introduce various data items to the program logic whteh are not properiy conceived 
of as television signals. For instance, software updates or other data may be 
passed to the system. The program logic receiving this data from the television 
stream may impose controls on how the data is handled, such as requiring certain 
authentteation sequences and/or decrypting the embedded infonnation according to 
some previously acquired key. Such a method worics for normal broadcast signals 
as well, leading to an efficient means of providing non-TV control information and 
data to the program logic. 
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Additionally, one skilled in tlie art will readily appreciate that although a VCR is 
specifically mentioned above, any multimedia recording device (e.g.. a Digital Video 
Disk-Random Access Memory (DVD-RAM) recorder) is easily substituted in it^ 
place. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appreciate that other applteations may 
be substituted for those set forth herein without departing from the spirit and scope 
of the present invention. For example, the invention can be used in the detection of 
gambling casino crime. The input section of the invention is connected to the casino's 
video surveillance system. Recorded video is cached and simultaneously output to 
external VCRs. The user can switch to any video feed and examine (/.e., rewind, 
play, slow play, fast forward, etc.) a specific segment of the recorded video while 
the extemal VCRs are being loaded with the real-time input video. Accordingly, the 
invention should only be limited by the Claims included below. 
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CLAIMS 

1 . A process for the simultaneous storage and play back of multimedia data, 
comprising the steps of: 

accepting television (TV) broadcast signals, wherein said TV signals are 
based on a multitude of standards, including, but not limKed to, National Television 
Standards Committee (NTSC) broadcast, PAL broadcast, satellite transmission, 
DSS. DBS, or ATSC; 

tuning said TV signals to a specific program; 

providing at least one Input Section, wherein said input section converts said 
specific program to an Moving Pictures Experts Group (MPEG) fomiatted stream 
for internal transfer and manipulation; 

providing a Media Switch, wherein said Media Switch parses said MPEG 
stream, said MPEG stream is separated into its video and audio components; 

storing said video and audio components on a storage device; 

providing at least one Output Section, wherein said Output Section extracts 
said video and audio components from a storage device; 

wherein said Output Section assembles said video and audio components 
into an MPEG stream; 

wherein said Output Section sends said MPEG stream to a decoder 

wherein said decoder converts said MPEG stream into TV output signals; 

wherein said decoder delivers said TV output signals to a TV receiver 

wherein the storing and extracting of said video and audio components from 
said storage device can be perfomfied simultaneously; and 

accepting control commands from a user, wherein said control commands are 
sent through the system and affect the flow of said MPEG stream. 

2. The process of claim 1 , wherein said Input Section directs said MPEG stream 
to the destination indicated by said control commands. 

3. The process of claim 1 , wherein said Output Section extracts said video and 
audio components from the storage device indicated by said control commands. 

4. The process of claim 1 , further comprising the step of: 

creating custom video output sequences, wherein said sequences are 
specified by a user or program control. 
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5. The process of claim 1 , wherein said Media Switch calculates time stamp 
values or extracts time stamp values from a digital TV stream and logically* 
associates said time stamp values to said video and audio components. 

6. The process of claim 1 , further comprising the steps of: 
placing said video component into a circular video buffer 
placing said audio component into a circular audio buffer, 

posting an event in a circular event buffer, wherein said event contains an 
indication that a video or audio component was found and the location of said video 
or audio component in said circular video or audio buffer; 

sending notice of said event posting; 

receiving said notice; 

retrieving said event posting from said event buffer; 

indexing into the appropriate buffer indicated by the type and location 
infomnation in said event buffer; and 

generating a buffer containing the logical audio or video segments in order, 
including ancillary information, wherein each of said logical segments points to the 
appropriate circular buffer location where conresponding audio or video components 
have been placed. 

7. The process of claim 1 , further comprising the step of: 

increasing or decreasing the decoder system clock rate for fast playback, fast 
reverse playback, slow playback, or slow reverse. 

8. The process of claim 1 , further comprising the step of: 

combining system audio cues and on-screen displays with said TV output 
signals. 

9. The process of claim 1 , further comprising the steps of: 

decoding the Vertical Blanking Inten/al (VBI) data or private data channel 
information from said TV signal; and 

examining said data to determine the starting or ending indicators of a specific 
program. 
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1 0. The process of claim 1 , further comprising the step of: 

scanning the words contained within the closed caption (CC) fields to 
determine program starting and ending times, wherein particular words or phrases* 
are used to trigger the recording of a specific program and wherein the CC 
information is presen/ed in time synchronization with the audio and video, and can be 
correctly presented to the viewer when the stream Is displayed; and 

perfomiing a specific action when a specific word Is found In said CC 
information. 

1 1 . The process of claim 1 , wherein said Media Switch operates asychronously 
and autonomously with a CPU and wherein said Media Switch allows the CPU to 
queue up Direct Memory Access (DMA) transfers. 

1 2. The process of claim 1 , further comprising the step of: 

providing a multimedia recording device, including, but not limited to, a Video 
Cassette Recorder (VCR) or a Digital Video Disk-Random Access Memory (DVD- 
RAM) device, wherein said recording device is attached to the output side of said 
decoder, allowing said user to record said TV output signals; 

wherein said user queues up programs from said disk to be stored on said 
recording device; 

wherein said user sets time schedules for said programs to be sent to said 
recording devtee; 

wherein title pages may be sent to said recording device before sending a 
program to be stored on said recording device; 

wherein a program that is longer in duration than a magnetic tape in said 
recording device allows, is sped up to fit within the desired time limit or has frames 
dropped from it to fit within the desired time limit, and 

wherein the output of said recording device is routed to said input section, 
allowing said recording device to act as a storage back up system, said recording 
device accepts overflow storage, TV programs, software updates, or other data that 
are later retrieved and sent to said input sectk>n. 

13. A process for the simultaneous storage and play back of multimedia data, 
comprising the steps of: 

providing a physical data source, wherein said physical data source accepts 
broadcast data from an input device, parses video and audio data from said 
broadcast data, and temporarily stores said video and audio data; 
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providing a source object, wherein said source object extracts video and 
audio data from said physical data source; 

providing a transform object, wherein said transfomn object stores and* 
retrieves MPEG streams onto a storage device; 

wherein said source object obtains a buffer from said transform object, said 
source object converts video data into MPEG streams and fills said buffer with said 
streams; 

wherein said source object is automatically flow controlled by said transfomn 
object; 

providing a sink object, wherein said sink object obtains MPEG stream 
buffers from said transform object and outputs said streams to a video and audio 
decoder; 

wherein said decoder converts said streams into TV signals and sends said 

signals to a TV receiven 

wherein said sink object is automatically flow controlled by said transform 

object; 

providing a control object, wherein said control object receives commands 
from a user, said commands control the flow of the broadcast data through the 
system; and 

wherein said control object sends flow command events to said source, 
transfomi, and sink objects. 

14. An apparatus for the simultaneous storage and play back of multimedia data, 
oomprising: 

a module for accepting television (TV) broadcast signals, wherein said TV 
signals are based on a multitude of standards, including, but not limited to, National 
Television Standards Committee (NTSC) broadcast, PAL broadcast, satellite 
transmission. DSS, DBS. or ATSC; 

a module for tuning said TV signals to a specific program; 

a module for providing at least one Input Section, wherein said input section 
converts said specific program to an Moving Pictures Experts Group (MPEG) 
formatted stream for internal transfer and manipulation; 

a module for providing a Media Switch, wherein said Media Switch parses 
said MPEG stream, said MPEG stream is separated into its video and audio 
components; 

a module for storing said video and audio components on a storage device; 
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a module for providing at least one Output Section, wherein said Output 
Section extracts said video and audio components from a storage device; 

wherein said Output Section assembles said video and audio components* 
into an MPEG stream; 

wherein said Output Section sends said MPEG stream to a decoder; 

wherein said decoder converts said MPEG stream into TV output signals; 

wherein said decoder delivers said TV output signals to a TV receiver 

wherein the storing and extracting of said video and audio components from 
said storage device can be performed simultaneously; and 

a module for accepting control commands from a user, wherein said control 
commands are sent through the system and affect the flow of said MPEG stream. 

15. The apparatus of claim 14, wherein said Input Section directs said MPEG 
stream to the destination indicated by said control commands. 

1 6. The apparatus of claim 1 4, wherein said Output Section extracts said 
video and audio components from the storage device indicated by said control 
commands. 

17. The apparatus of claim 1 4. further comprising: 

a module for creating custom video output sequences, wherein said 
sequences are specified by a user or program control. 

1 8. The apparatus of claim 1 4, wherein said Media Switch calculates time stamp 
values or extracts time stamp values from a digital TV stream and logically 
associates said time stamp values to said video and audio components. 

1 9. The apparatus of claim 1 4, further comprising: 

a module for placing said video component into a circular video buffer; 

a module for placing said audio component into a circular audio buffer, 

a module for posting an event in a circular event buffer, wherein said event 
contains an Indication that a video or audio component was found and the location of 
said video or audio component in said circular video or audio buffer, 

a module for sending notice of said event posting; 

a module for receiving said notice; 

a module for retrieving said event posting from said event buffer; 
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a module for indexing into the appropriate buffer indicated by the type and 
location information in said event buffer; and 

a module for generating a buffer containing the logical audio or video* 
segments in order, including ancillary information, wherein each of said logical 
segments points to the appropriate circular buffer location where corresponding 
audio or video components have been placed. 

20. The apparatus of claim 1 4, further comprising: 

a module for increasing or decreasing the decoder system clock rate for fast 
playback, fast reverse playback, slow playback, or slow reverse. 

21 . The apparatus of claim 1 4, further comprising: 

a module for combining system audio cues and on-screen displays with said 
TV output signals. 

22. The apparatus of claim 1 4, further comprising: 

a module for decoding the Vertical Blanking Intense! (VBI) data or private data 
channel information from sakJ TV signal; and 

a module for examining said data to detemnine the starting or ending indicators 
of a specific program. 

23. The apparatus of daim 14, further comprising: 

a module for scanning the words contained within the closed caption (CC) 
fields to detemriine program starting and ending times, wherein particular words or 
phrases are used to trigger the recording of a specific program and wherein the CC 
information is preserved in time synchronization with the audio and vkteo. and can be 
conBctly presented to the viewer when the stream is displayed; and 

a module for perfomiing a specific action when a spedffc word is found in said 
CC information. 

24. The apparatus of claim 14, wherein said Media Switch operates 
asychronously and autonomously with a CPU and wherein said Media Switch allows 
the CPU to queue up Direct Memory Access (DMA) transfers. 

25. The apparatus of claim 14. further comprising: 

a multimedia recording device, including, but not limited to. a Video Cassette 
Recorder (VCR) or a Digital Video Disk-Random Access Memory (DVD-RAM) 
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device, wherein said recxDrding device is attached to the output side of said decoder, 
allowing said user to record said TV output signals; 

wherein said user queues up programs from said disk to be stored on said^ 
recording device; 

wherein said user sets time schedules for said programs to be sent to said 
recording device; 

wherein title pages may be sent to said recording device before sending a 
program to be stored on said recording device; 

wherein a program that is longer in duration than a magnetic tape in said 
recording device allows, is sped up to fit within the desired time limit or has frames 
dropped from it to fit within the desired time limit; and 

wherein the output of said recording device is routed to said input section, 
allowing said recording device to act as a storage back up system, said recording 
device accepts overflow storage, TV programs, software updates, or other data that 
are later retrieved and sent to said input section. 

26. An apparatus for the simultaneous storage and play back of multirtiedia data, 
comprising: 

a physical data source, wherein said physical data source accepts broadcast 
data from an input device, parses video and audio data from said broadcast data, 
and temporarily stores said video and audk) data; 

a source object, wherein said source object extracts video and audio data 
from said physical data source; 

a transform object, wherein said transform object stores and retrieves MPEG 
streams onto a storage device; 

wherein said source object obtains a buffer from said transform object, said 
source object converts video data into MPEG streams and fills said buffer with said 
streams; 

wherein said source object is automatically flow controlled by said transform 

object; 

a sink object, wherein said sink object obtains MPEG stream buffers from 
said transfomi object and outputs said streams to a video and audio decoder; 

wherein said decoder converts said streams into TV signals and sends said 
signals to a TV receiver, 

wherein said sink object is automatically flow controlled by said transfomrt 

object; 
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a control object, wherein said control object receives commands from a user, 
said commands control the flow of the broadcast data through the system; and 

wherein said control object sends flow command events to said source,^ 
transform, and sink objects. 
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